C/3 



(N 



X 



Interference Mitigation Techniques 
for Clustered Multicell Joint Decoding Systems 

Symeon Chatzinotas, Bjom Ottersten 



O 

(N . 
, Abstract 

Multicell joint processing has originated from information-theoretic principles as a means of reach- 
■^ ' ing the fundamental capacity limits of cellular networks. However, global multicell joint decoding is 

(N : 

highly complex and in practice clusters of cooperating Base Stations constitute a more realistic scenario. 



In this direction, the mitigation of intercluster interference rises as a critical factor towards achieving 
the promised throughput gains. In this paper, two intercluster interference mitigation techniques are 
investigated and compared, namely interference alignment and resource division multiple access. The 



p^ I cases of global multicell joint processing and cochannel interference allowance are also considered as 

> 



an upper and lower bound to the interference ahgnment scheme respectively. Each case is modelled 
and analyzed using the per-cell ergodic sum-rate throughput as a figure of merit. In this process, the 
asymptotic eigenvalue distribution of the channel covariance matrices is analytically derived based on 
free-probabilistic arguments in order to quantify the sum-rate throughput. Using numerical results, it 



o. 

^D . is established that resource division multiple access is preferable for dense cellular systems, while 



cochannel interference allowance is advantageous for highly sparse cellular systems. Interference align- 
ment provides superior performance for average to sparse cellular systems on the expense of higher 



H I complexity. 
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I. Introduction 

Currently cellular networks carry the main bulk of wireless traffic and as a result they risk 
being saturated considering the ever increasing traffic imposed by internet data services. In 
this context, the academic community in collaboration with industry and standardization bodies 
have been investigating innovative network architectures and communication techniques which 
can overcome the interference-limited nature of cellular systems. The paradigm of multicell 
joint processing has risen as a promising way of overcoming those limitations and has since 
gained increasing momentum which lead from theoretical research to testbed implementations 
[[H. Furthermore, the recent inclusion of CoMP (Coordinated Multiple Point) techniques in LTE- 
Advanced BH serves as a reinforcement of the latter statement. 

Multicell joint processing is based on the idea that signal processing does not take place 
at individual Base Stations (BSs), but at a central processor which can jointly serve the User 
Terminals (UTs) of multiple cells through the spatially distributed BSs. It should be noted 
that the main concept of multicell joint processing is closely connected to the rationale behind 
Network MIMO and Distributed Antenna Systems (DAS) and those three terms are often utilized 
interchangeably in the literature. According to the global multicell joint processing, all the BSs of 
a large cellular system are assumed to be interconnected to a single central processor through an 
extended backhaul. However, the computational requirements of such a processor and the large 
investment needed for backhaul links have hindered its realization. On the other hand, clustered 
multicell joint processing utilizes multiple signal processors in order to form BS clusters of 
limited size, but this localized cooperation introduces intercluster interference into the system, 
which has to be mitigated in order to harvest the full potential of multicell joint processing. In this 
direction, reuse of time or frequency channel resources (resource division multiple access) could 
provide the necessary spatial separation amongst clusters, an approach which basically mimics 
the principles of the traditional cellular paradigm only on a cluster scale. Another alternative 
would be to simply tolerate intercluster signals as cochannel interference, but obviously this 
scheme becomes problematic in highly dense systems. 

Taking all this into account, the current paper considers the uplink of a clustered Multicell Joint 
Decoding (MJD) system and proposes a new communication strategy for mitigating intercluster 
interference using Interference Alignment (lA). More specifically, the main contributions herein 



are: 

1) the channel modelling of a clustered MJD system with lA as intercluster interference 
mitigation technique, 

2) the analytical derivation of the ergodic throughput based on free probabilistic arguments 
in the R-transform domain, 

3) the analytical comparison with the upper bound of global MJD, the Resource Division 
Multiple Access (RDMA) scheme and the lower bound of clustered MJD with Cochannel 
Interference allowance (CI), 

4) the comparison of the derived closed-form expressions with Monte Carlo simulations and 
the performance evaluation using numerical results. 

The remainder of this paper is structured as follows: Section [II] reviews in detail prior work in 
the areas of clustered MJD and lA. Section Ull] describes the channel modelling, free probability 
derivations and throughput results for the following cases: a) global MJD, b) lA, c) RDMA and 
d) CI. Section HV] displays the accuracy of the analysis by comparing to Monte Carlo simulations 
and evaluates the effect of various system parameters in the throughput performance of clustered 
MJD. Section |V] concludes the paper. 

A. Notation 

Throughout the formulations of this paper, E[-] denotes expectation, (■) denotes the conjugate 
matrix transpose, (■) denotes the matrix transpose, denotes the Hadamard product and 
denotes the Kronecker product. The Frobenius norm of a matrix or vector is denoted by ||-|| and 
the delta function by 5{-). I„ denotes an nx n identity matrix, I„xm an nx m matrix of ones, 
a zero matrix and Gnxm ~ CA/'(0,I„) denotes nx m Gaussian matrix with entries drawn form 
a CJ\f{0, 1) distribution. The figure of merit analyzed and compared throughout this paper is the 
ergodic per-cell sum-rate throughpuu- 

'The term throughput is used instead of capacity since the described techniques lead to achievable sum-rates except for MJD 
which leads to MIMO MAC capacity. 



II. Related Work 

A. Multicell Joint Decoding 

This section reviews the literature on MJD systems by describing the evolution of global MJD 
models and subsequently focusing on clustered MJD approaches. 

1) Global MJD: It was almost three decades ago when the paradigm of global MJD was 
initially proposed in two seminal papers |[3l, Pl, promising large capacity enhancements. The 
main idea behind global MJD is the existence of a central processor (a.k.a. "hyper-receiver") 
which is interconnected to all the BSs through a backhaul of wideband, delayless and error-free 
links. The central processor is assumed to have perfect Channel State Information (CSI) about all 
the wireless links of the system. The optimal communication strategy is superposition coding at 
the UTs and successive interference cancellation at the central processor. As a result, the central 
processor is able to jointly decode all the UTs of the system, rendering the concept of intercell 
interference void. 

Since then, the initial results were extended and modified by the research community for 
more practical propagation environments, transmission techniques and backhaul infrastructures 
in an attempt to more accurately quantify the performance gain. More specifically, it was 
demonstrated in [5] that Rayleigh fading promotes multiuser diversity which is beneficial for 
the ergodic capacity performance. Subsequently, realistic path-loss models and user distributions 
were investigated in |l6l, |I71 providing closed-form ergodic capacity expressions based on the cell 
size, path loss exponent and geographical distribution of UTs. The beneficial effect of MIMO 
links was established in [[8]|, flUl, where a linear scaling of the ergodic per-cell sum-rate capacity 
with the number of BS antennas was shown. However, correlation between multiple antennas 
has an adverse effect as shown in [ITOl . especially when correlation affects the BS-side. Imperfect 
backhaul connectivity has also a negative effect on the capacity performance as quantified in [1 1]. 
MJD has been also considered in combination with DS-CDMA [12J, where chips act as multiple 
dimensions. Finally, linear MMSE filtering [fT3l . [O followed by single-user decoding has been 
considered as an alternative to the optimal multiuser decoder which requires computationally- 
complex successive interference cancellation. 

2) Clustered MJD: Clustered MJD is based on forming groups of M adjacent BSs (clusters) 
interconnected to a cluster processor. As a result, it can be seen as an intermediate state between 



traditional cellular systems (Af = 1) and global MJD (M = oo). The advantage of clustered 
MJD lies on the fact that both the size of the backhaul network and the number of UTs to be 
jointly processed decrease. The benefit is twofold; firstly, the extent of the backhaul network is 
reduced and secondly, the computational requirements of MJD (which depend on the number 
of UTs) are lower. The disadvantage is that the sum-rate capacity performance is degraded 
by intercluster interference, especially affecting the individual rates of cluster-edge UTs. This 
impairment can be tackled using a number of techniques as described here. The simplest approach 
is to just treat it as cochannel interference and evaluate its effect on the system capacity as in 
[fTSl . An alternative would be to use RDMA, namely to split the time or frequency resources 
into orthogonal parts dedicated to cluster-edge cells [fT6ll . This approach eliminates intercluster 
interference but at the same time limits the available degrees of freedom. In DS-CDMA MJD 
systems, knowledge of the interfering codebooks has been also used to mitigate intercluster 
interference (12]- Finally, antenna selection schemes were investigated as a simple way of 
reducing the number of intercluster interferers ifTTl . 

B. Interference Alignment 

This section reviews the basic principles of lA and subsequently describes existing applications 
of lA on cellular networks. 

1) lA Preliminaries: lA has been shown to achieve the degrees of freedom (dofs) for a range 
of interference channels [fTSl . [fT9ll . [|20ll . Its principle is based on aligning the interference on 
a signal subspace with respect to the non-intended receiver, so that it can be easily filtered out 
by sacrificing some signal dimensions. The advantage is that this alignment does not affect the 
randomness of the signals and the available dimensions with respect to the intended receiver. 
The disadvantage is that the filtering at the non-intended receiver removes the signal energy in 
the interference subspace and reduces the achievable rate. The fundamental assumptions which 
render lA feasible are that there are multiple available dimensions (space, frequency, time or 
code) and that the transmitter is aware of the CSI towards the non-intended receiver. The exact 
number of needed dimensions and the precoding vectors to achieve lA are rather cumbersome 
to compute, but a number of approaches have been presented in the literature towards this end 

lEn, ea, im. 



2) lA and Cellular Networks: lA has been also investigated in the context of cellular networks, 
showing that it can effectively suppress cochannel interference [!24l| . f23\ . More specifically, the 
downlink of an OFDMA cellular network with clustered BS cooperation is considered in f25\, 
where lA is employed to suppress intracluster interference while intercluster interference has to 
be tolerated as noise. Using simulations, it is shown therein that even with unit multiplexing 
gain the throughput performance is increased compared to a frequency reuse scheme, especially 
for the cluster-centre UTs. In a similar setting, the authors in [|26l propose an lA-based resource 
allocation scheme which jointly optimizes the frequency-domain precoding, subcarrier user 
selection, and power allocation on the downlink of coordinated multicell OFDMA systems. 
In addition, authors in [24J consider the uplink of a limited-size cellular system without BS 
cooperation, showing that the interference-free dofs can be achieved as the number of UTs 
grows. Employing lA with unit multiplexing gain towards the non-intended BSs, they study the 
effect of multi-path channels and single-path channels with propagation delay. Furthermore, the 
concept of decomposable channel is employed to enable a modified scheme called subspace lA, 
which is able to simultaneously align interference towards multiple non-intended receivers over 
a multidimensional space. Finally, the effect of limited feedback on cellular lA schemes has 
been investigated and quantified in ll25l . Il27ll . 

III. Channel Model & Throughput Analysis 

In this paper, the considered system comprises a modified version of Wyner's linear cellular 
array flU, lfT2ll . Il28l . which has been used extensively as a tractable model for studying MJD 
scenarios. In the modified model studied herein, MJD is possible for clusters of M adjacent 
BSs while the focus is on the uplink. Unlike [[24|. Il23l . lA is employed herein to mitigate 
intercluster interference between cluster-edge cells. Let us assume that K UTs are positioned 
between each pair of neighboring BSs with path loss coefficients 1 and a respectively (Figure 
[T]). All BSs and UTs are equipped with n = K + 1 antennas to enable lA over the multiple 
spatial dimensions for the clustered UTs. In this setting, four scenarios of intercluster interference 
are considered, namely global MJD, lA, RDMA and CI. It should be noted that only cluster- 
edge UTs employ interference mitigation techniques, while UTs in the interior of the cluster 
use the optimal wideband transmission scheme with superposition coding as in [5J. Successive 
interference cancellation is employed in each cluster processor in order to recover the UT signals. 



Furthermore, each cluster processor has full CSI for all the wireless links in its coverage area. 
The following subsections explain the mode of operation for each approach and describe the 
analytical derivation of the per-cell sum-rate throughput. 

A. Global Multicell Joint Decoding 

In global MJD, a central processor is able to jointly decode the signals received by neighboring 
clusters and therefore no intercluster interference takes place. In other words, the entire cellular 
system can be assumed to be comprised of a single extensive cluster. As it can be seen, this case 
serves as an upper bound to the lA case. The received ra x 1 symbol vector yj at any random 
BS can be expressed as follows: 

y,(t) = G,,,(t)x,(t) + aG,,,+i(t)xi+i(t) + Zi(t), (1) 

where the n x 1 vector z denotes AWGN with E[zi] = and IE[zizf ] = I. The Kn x 1 
vector Xj denotes the transmitted symbol vector of the ith UT group with E[xixf^] = 7I where 
7 is the transmit Signal to Noise Ratio per UT antenna. The n x Kn channel matrix Gj j ~ 
CA/'(0, I„) includes the flat fading coefficients of the zth UT group towards the zth BS modelled as 
independent identically distributed (i.i.d.) complex circularly symmetric (c.c.s.) random variables. 
Similarly, the term aGj .(+i(t)xj_|_i(t) represents the received signal at the ?th BS originating from 
the UTs of the neighboring cell indexed i + \. The scaling factor a < \ models the amount 
of received intercell interference which depends on the path loss model and the density of 
the cellular systerru. Another intuitive description of the a factor is that it models the power 
imbalance between intra-cell and inter-cell signals. 

Assuming a memoryless channel, the system channel model can be written in a vectorial form 
as follows: 

y = Hx + z, (2) 

where the aggregate channel matrix has dimensions Mn x (M -|- l)Kn and can be modelled as: 

H = S G (3) 

^For more details on the modelling of the a parameter, the reader is referred to 1291 . 



with S = S (g) InxKn being a block- Toeplitz matrix and G ~ CA/'(0, Imu)- In addition, S is a 
M X M + 1 Toeplitz matrix structured as follows: 

la •■■ 



Ola 







(4) 



■■■ la 

Assuming no CSI at the UTs, the per-cell capacity is given by the MIMO Multiple Access 
(MAC) channel capacity: 



CMJD = ^E[X(x;y|H)] 

= ^E[logdet(lMn + 7HH^)]. 



(5) 



Theorem 3.1: In the global MJD case, the per-cell capacity for asymptotically large n con- 
verges almost surely (a.s.) to the Marcenko-Pastur (MP) law with appropriate scaling [[6l, ifTOl : 



Cmjd — > KnVup I ^ ^ ^7 (1 + a j , K ^ 



(6) 



where Vmp (7,/3) = log ( 1 + 7 - ^0(7,/3) j + ^log U + 7/3 - \<P{l,P) ) - ^<^(7,/3) 



and 0(7,/3)= (^^(^l + v^j' + l-y^^l-^) 



Proof: For the sake of completeness and to facilitate latter derivations, an outline of the 
proof in ^, ifTOl is provided here. The derivation of this expression is based on an asymptotic 



analysis in the number of antennas n — )• oo: 



-Cmjd = lim -— E [logdet (Ia/„ + 7HH^)] 



lim E 

n— >oo 



POO 

= / \og{l + M^x)f^^^„{x)dx 
Jo *^" 

/■oo 

= ir / log (1 + M^x) f°^„„„{x)dx 
Jo ^'^ 

Mn 

^irVMp(g(S)M7,ir), (7) 

where Aj (X) and f^ denote the eigenvalues and the asymptotic eigenvalue probability distri- 
bution function (a.e.p.d.f.) of matrix X respectively and Vx(a;) = E[log(l + xX)] denotes the 
Shannon transform of X with scalar parameter x. It should be noted that ^ = wj denotes the total 
UT transmit power normalized by the receiver noise poweilj. The last step of the derivation is 
based on unit rank matrices decomposition and analysis on the R-transform domain, as presented 
in |[6l, ifTOl . The scaling factor 

q{i:) = \\i:f /{Mnx{M + l)Kn) (8) 



is the Frobenius norm of the S matrix ||S|| = ^ytr {S^S} normalized by the matrix dimensions 
and 

where step (a) follows from [fTOl Eq.(34)]. ■ 

B. Interference Alignment 

In order to evaluate the effect of lA as an intercluster interference mitigation technique, a 
simple precoding scheme is assumed for the cluster-edge UT groups, inspired by [24J. Let us 
assume a n x 1 unit norm reference vector v with ||v|p = n and 

yi = Gi,iXi + aGi,2X2 + zi (10) 

'For the purposes of the analysis the variable 7 is kept finite as the number of antennas Mn grows large, so that the system 
power does not grow to infinity. 
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yu = Gm.mXm + aGM,M+i^M+l + Zm, (11) 

where yi and y^ represent the received signal vectors at the first and last BS of the cluster 
respectively. The first UT group has to align its input xi towards the non-intended BS of the 
cluster on the left (see Fig. [T]), while the Mth BS has to filter our the aligned interference coming 
from the M + 1th UT group which belongs to the cluster on the right. These two strategies are 
described in detail in the following subsections: 

1) Aligned Interference Filtering: The objective is to suppress the term aGA/,Af+iXj\/+i which 
represents intercluster interference. It should be noted that UTs of the M + 1th cell are assumed 
to have perfect CSI about the channel coefficients GA/,Af+i- Let us also assume that x^ and G^ . 
represent the transmitter vector and channel matrix of the jth UT in the ith group towards the 
iih. BS. In this context, the following precoding scheme is employed to align interference: 

^M+l ~ V^M,M+l) ^J^M+li (1^) 

TT 

where v^ = wvj is a scaled version of v which satisfies the input power constraint E[x'J^^^^x'J^^^^ ] 
7I. This precoding results in unit multiplexing gain and is by no means the optimal lA scheme, 
but it serves as a tractable way of evaluating the lA performance. Following this approach, the 
intercluster interference can be expressed as: 



aGAf,Af+lXA/+l - « 2^ ^M,M+l^\l+l - « 2^ ^M,M+l {^M,M+l) ^^i^M+1 " '^^ Z^ ^j^M+V 

(13) 
It can be easily seen that interference has been aligned across the reference vector and it can be 
removed using a. K x n zero-forcing filter Q designed so that Q is a truncated unitary matrix 
lfT9l and Qv = 0. After filtering, the received signal at the Mth BS can be expressed as: 

Ym = QG^^M^M + zj\/, (14) 

Assuming that the system operates in high-SNR regime and is therefore interference limited, the 
effect of the AWGN noise colouring za/ = Qzjyj can be ignored, namely E[za/z^] = 1^. 

Lemma 3.1: The Shannon transform of the covariance matrix of QGj^^ jy^ is equivalent to that 
of a K X K Gaussian matrix Gkxk- 
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Proof: Using the property det(I + 7AB) = det(I + 7BA), it can be written that: 

det (I^ + iQGmm^^^mm)'') = det (ix + tQCa/.a/Gm^/ Q"") 

= det (I„ + 7G^,,MQ^QGAf,Af) • (15) 

The K X n truncated unitary matrix Q has K unit singular values and therefore the matrix 
product Q^Q has K unit eigenvalues and a zero eigenvalue. Applying eigenvalue decomposition 
on Q^Q, the left and right eigenvectors can be absorbed by the isotropic Gaussian matrices 
G^ j^^j and Gm,m respectively, while the zero eigenvalue removes one of the n dimensions. 
Using the definition of Shannon transform [[30l . eq. (fT5l) yields 

VqGm,m(QGm,m)«(7) = ^GK.KG»,^il)- (16) 

■ 

Based on this lemma and for the purposes of the analysis, QGj^/ ^^ is replaced by Gkxk in the 
equivalent channel matrix. 

2) Interference Alignment: The Af th BS has filtered out incoming interference from the cluster 
on the right (Figured]), but outgoing intercluster interference should be also aligned to complete 
the analysis. This affects the first UT group which should align its interference towards the Mth 
BS of the cluster on the left (Figure [I]). Following the same precoding scheme and using eq. 
Gl) 

K K 

Gi.ixi = Y, Gi,ixi = Y. Gi, (G^o,i)"' vt;,a;i, (17) 






where Gq 1 represents the fading coefficients of the jth UT of the first group towards the Mth 
BS of the neighboring cluster on the left. Since the exact eigenvalue distribution of the matrix 
product G\i{Gqi)^^yvj is not straightforward to derive, for the purposes of rate analysis it 
is approximated by a Gaussian vector with unit variance. This approximation implies that lA 
precoding does not affect the statistics of the equivalent channel towards the intended BS. 

3) Equivalent Channel Matrix: To summarize, lA has the following effects on the channel 
matrix H used for the case of global MJD. The intercluster interference originating from the M+ 
1th UT group is filtered out and thus Kn vertical dimensions are lost. During this process, one 
horizontal dimension of the Af th BS is also filtered out, since it contains the aligned interference 
from the M + 1th UT group. Finally, the first UT group has to precode in order to align its 
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interference towards the Mth BS of neighboring cluster and as a result only K out of Kn 
dimensions are preserved. The resulting channel matrix can be described as follows: 



where Gja ~ CA/'(0, Im„_i) and 



HiA — SiA GiA, 



Si 

S2 
S3 



(18) 



■TA 



(19) 



Jl 



|0( 



M-2)nxK ^M-2xM-l'< 



with El = [I„xx ainxxn 0„x(Af-2)i^n] being a nx (M-l)Kn+K matri: 

Inxi^n] being a {M -2)n x (M- lU^n + iT matrix and S3 = [0„_ix(Af-2)Xn+K In-ix/<n] being 

a n — 1 X (M — 1)A^?2 + K matri?qf • 

Since all intercluster interference has been filtered out and the effect of filter Q has been 
already incorporated in the structure of Hja, the per-cell throughput in the lA case is still given 
by the MIMO MAC expression: 



CiA = — E[X(x;y|HiA)] 

= ^E [logdet {iMn-i + 7HiaHi^)] • 



(20) 



Theorem 3.2: In the lA case, the per-cell throughput can be derived from the R- transform of 
the a.e.p.d.f. of matrix -H/^Hia- 

Proof: Following an asymptotic analysis where n —^ 00: 

-CiA = lim ^E [logdet [Imu-i + tHjaH^)] 



Mn n^oo 



Mn-1 



Mn 



Mn JO 
{M-l)Kn + K 
Mn 



^^ y log ( 

log (1 + 7x) /f^ ^« (x)rfx 



I + 7A. (-HiaHi^ 



HArtjA ' 



n lA "^^ 



(21) 



''The structure of the first block of Si originates in the Gaussian approximation of — Gj i(Gq i) ^"vvj. 
^The structure of the last block of S3 is based on Lemma [3J] 
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The a.e.p.d.f. of ^H^Hja is obtained by determining the imaginary part of the Stieltjes transform 
S for real arguments 

considering that the Stieltjes transform is derived from the R-transform [|3TI as follows 



^^u«u,M = T^^H"u,A-^) - -■ (23) 



Theorem 3.3: The R-transform of the a.e.p.d.f. of matrix -Hf^HjA is given by: 

3 

^iHf,HiA(^) = y'^iH«H,(2;,fci,A,gi) (24) 

i=l 

with k, j3,q parameters given by: 

Hi -.ki = , /3i = ^ K,qi = ^ — 

MK + M-K'^ K + l '^ K + 2 

„ , (M-l)(ir + l) ^ M-1 ^ M-2,^ , 2^ 

Ho -.ko = -, do = K, Go = (1 + a ) 

MK + M-K'' M-2'^ M-V ' 

K + 1 

H3 ±3 = , (33 = K +1,03 = 1 

^ ^ MK + M -K''^ '^^ 

and T^iH^H given by theorem IATTI 

Proof: Based on eq.(fT9l). the matrix H^Hja can be decomposed as the following sum: 

HjaHia = H^ Hi + H2 H2 + H3 H3, (25) 

where Hi = Y,iQGnx{M-l)Kn+K^ H2 = S20G(jv/-2)nx(M-l)ii'n+_ft' ^nd H3 = S30G„_ix(Af-l)ii'n+K- 

Using the property of free additive convolution [30] and Theorem lA.ll in Appendix |Al eq. (I24l) 
holds in the R-transform domain. ■ 

C. Resource Division Multiple Access 

RDMA entails that the available time or frequency resources are divided into two orthogonal 
parts assigned to cluster-edge cells in order to eliminate intercluster interference [fT6l Efficient 
isolation scheme]. While double the power can be transmitted in each orthogonal part, the 
available channel resources are cut in half for cluster-edge cells. The channel modelling is 
similar to the one in global MJD case (eq. ©), although in this case the throughput is analyzed 



14 



separately for each orthogonal part and subsequently averaged. Assuming no CSI at the UTs, 
the per-cell throughput in the RDMA case is given by: 

CrDi + Cpj,D2 



c 



RD 



1 



2M 



(E [logdet {iMn + THRD.HfoJ] + E [logdet (I(a/_i)„ + tHrd.H^dJ]) , (26) 



where Crd^ and Crdj denote the capacities when the cluster-edge UTs are active and inactive 
respectively. When the cluster-edge UTs are active, the cluster processor receives signals from 
all M BSs and the resulting Mn x MKn channel matrix is structured as follows: 

H 

HrDi 



HrDi — SrDi G 



Mn X MKn 



with Hrdi 



SrDi 



5^RDi 



and SrDi — [0„x(M-l)Xn '^InxKn], 



(27) 



where the factor 2 is due to the doubling of the transmitted power. 

Theorem 3.4: In the RDMA case with active cluster-edge UTs, the per-cell throughput Crdi 
can be derived from the R-transform of the a.e.p.d.f. of matrix -H^j-, Hrd^, where: 



T^IhH tt (z) 



{M-l){l + a') 1 

1 - (1 + a^)KMz + ^^b-b(^, ^, ^, 2) 



(28) 



Proof: Following an asymptotic analysis where n — > oo: 

-Crdi = lim T^E [logdet (iMn + THROiKf^J] 



n 



n-i-oo Mn 

r>00 



= K log(l+7x)/rHH Hh, i^)d^- 
Using the matrix decomposition of eq. (l27l) and free additive convolution |[30] 



71 tiUi rxui j^ y^ RAJ I SXLJj^ 

Eq. (128]) follows from eq. dH with g = Mg(f:) = (M - 1)(1 + a^), /3 
theorem lA.li 



(29) 



(30) 

KM/{M - 1) and 
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When the cluster-edge UTs are inactive, the cluster processor receives signals from M — 1 
BSs and the resulting M — 1 x (M — l)Kn channel matrix is structured as follows: 

H 

HrD2 



HrD2 — 5]rd2 G(M-l)nx(M-l)E'n with Hrd2 



^RD2 



SrD2 



and 5]rd2 — [0nx(M-2)Kn '^nxKn] 



(31) 



Theorem 3.5: In the RDMA case with inactive cluster-edge UTs, the per-cell throughput Crd2 
can be derived from the R-transform of the a.e.p.d.f. of matrix ^li^^Q^ii^j:,^^ where: 



{M-2){l + a^) 
^^«hb2Hkd2(^) - i-(i + a2)ir(M-l)z+^^^"^^^' 



M^'^'^^ 



(32) 



Proof: Following an asymptotic analysis where n — > oo: 

1 1 

-Crd2 = lim —-^ [logdet (l(M-i)n + 7HRD2HfD2)] 



K- 



M 



M 



log (1 + 7x) fT^H „ {x)dx. 



RD, "R.D2 



The rest of this proof follows the steps of Theorem 13.41 



(33) 



D. Cochannel Interference Allowance 

CI is considered as a worst case scenario where no signal processing is performed in order 
to mitigate intercluster interference [32|. As it can be seen, this case serves as a lower bound 
to the lA case. The channel modelling is identical with the one in global MJD case (eq. O), 
although in this case the cluster-edge UT group contribution aGAf,M+i(i^)xA/+i(t) is considered 
as interference. As a result, the interference channel matrix can be expressed as: 

^MnxKn 
OiGnxKn 



Hi 



(34) 



Assuming no CSI at the UTs, the per-cell throughput in the CI case is given by: 

Cci = Cmjd — Ci5 



(35) 
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where Ci denotes the throughput of the interfering UT group normalized by the cluster size: 

Ci = ^E[X(x;y|Hi)] 

= ^E [logdet {Imu + 7HiHf )] . (36) 



Theorem 3.6: In the CI case, the per-cell throughput converges almost surely (a.s.) to a 
difference of two scaled versions of the the MP law: 



Proof: Following an asymptotic analysis in the number of antennas n — )■ oo:: 

-Ci = lim -^E [logdet {Imu + 7HiHf )] 

= ^^^ Tl~^ [logdet [in + 7a^G„xi^nG^xXn)] 

n-j-oo Mn 

^^VMp{a^,K). (38) 

Eq. (137]) follows from eq. dMl), dMl) and Theorem O ■ 



E. Degrees of freedom 

This section focuses on comparing the degrees of freedom for each of the considered cases. 
The degrees of freedom determine the number of independent signal dimensions in the high SNR 
regime and it is also known as prelog or multiplexing gain in the literature. It is a useful metric 
in cases where interference is the main impairment and AWGN can be considered unimportant. 

Theorem 3. 7: The degrees of freedom per BS antenna for the global MJD, lA, RDMA and 
CI cases are given by: 

C^MJD = 1, C^IA = 1 - T-^, C^RD = 1 - 7777, dci= I - — • (39) 

Mn 2M M 

Proof: Eq. (|39l ) can be derived straightforwardly by counting the receive dimensions of 
the equivalent channel matrices (eq. ([3]) for global MJD, eq. (fTSi) for lA, eq. (l27l) and (|3T1) for 
RDMA, eq. (|34|) for CI) and normalizing by the number of BS antennas. ■ 
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Lemma 3.2: The following inequalities apply for the dofs of eq. (I39l) : 

C^MJD > C^IA > C^RD > C^CI- (40) 

Remark 3.1: It can be observed that diA = c^rd only for single UT per cell equipped with 
two antennas (K = l,n = 2). For all other cases, djA > ^rd- Furthermore, it is worth noting 
that when the number of UTs K and antennas n grows to infinity, limj^ „^oo c^ia = c^mjd which 
entails a multiuser gain. However, in practice the number of served UTs is limited by the number 
of antennas (n = K + 1) which can be supported at the BS- and more importantly at UT-side 
due to size limitations. 

F. Complexity Considerations 

This paragraph discusses the complexity of each scheme in terms of decoding processing and 
required CSI. In general, the complexity of MJD is exponential with the number of users [33] and 
full CSI is required at the central processor for all users which are to be decoded. This implies 
that global MJD is highly complex since all system users have to be processed at a single point. 
On the other hand, clustering approaches reduce the number of jointly-processed users and as 
a result complexity. Furthermore, CI is the least complex since no action is taken to mitigate 
intercluster interference. RDM A has an equivalent receiver complexity with CI but in addition it 
requires coordination between adjacent clusters in terms of splitting the resources. For example, 
time division would require inter-cluster synchronization, while frequency division could be even 
static. Finally, lA is the most complex since CSI towards the non-intended BS is also needed at 
the transmitter in order to align the interference. Subsequently, additional processing is needed 
at the receiver side to filter out the aligned interference. 

IV. Numerical Results 

This section presents a number of numerical results in order to illustrate the accuracy of 
the derived analytical expressions for finite dimensions and evaluate the performance of the 
aforementioned interference mitigation schemes. In the following figures, points represent values 
calculated through Monte Carlo simulations, while lines refer to curves evaluated based on the 
analytical expressions of section [nil More specifically, the simulations are performed by generat- 
ing 10^ instances of random Gaussian matrices, each one representing a single fading realization 
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of the system. In addition, the variance profile matrices are constructed deterministically based on 
the considered a factors and used to shape the variance of the i.i.d. c.c.s. elements. Subsequently, 
the per-cell capacities are evaluated by averaging over the system realizations using: a) eq. ([5]) 
for global MJD, b) eq. (HS-Gl) and ^ for lA, c) eq. ^ for RDMA, d) eq. dM]),® for 
CI. In parallel, the analytical curves are evaluated based on: a) theorem 13.11 for global MJD, 
b) theorems [Sj and [33] for lA, c) theorems [Ml and [33] for RDMA, d) theorems [3j] and [16] 
for CI. Table [I] presents an overview of the parameter values and ranges used for producing the 
numerical results of the figures. 

Firstly, figure [2] depicts the per-cell throughput versus the cluster size M for medium a factors. 
It should be noted that the a factor combines the effects of cell size and path loss exponent 
as explained in [29]. As expected the performance of global MJD does not depend on the 
cluster size, since it is supposed to be infinite. For all interference mitigation techniques, it can 
be seen that the penalty due to the clustering diminishes as the cluster size increases. Similar 
conclusions can be derived by plotting the degrees of freedom versus the cluster size M (figure 
[3]). In addition, it can be observed that the lA dofs approach the global MJD dofs as the number 
of UTs and antennas increases. Subsequently, Figure [4] depicts the per-cell throughput versus the 
a factor. For high a factors, RDMA performance converges to lA, whereas for low a factors 
RDMA performance degrades and touched the CI curve. It should be also noted that while 
the performance of global MJD and RDMA increase monotonically with a, the performances 
of lA and cochannel interference degrade for medium a factors. Finally, figure [5] depicts the 
per-cell throughput versus the number of UTs per cell K. It should be noted that the number of 
antennas per UT n scale jointly with K. Based on this observation, a superlinear scaling of the 
performance can be observed, resulting primarily from the increase of spatial dimensions (more 
antennas) and secondarily from the increase of the system power (more UTs). As it can be seen, 
the slope of the linear scaling is affected by the selected interference mitigation technique. 

V. Conclusion 

In this paper, various techniques for mitigating intercluster interference in clustered MJD 
were investigated. The case of global MJD was initially considered as an upper bound, serving 
in evaluating the degradation due to intercluster interference. Subsequently, the lA scheme 
was analyzed by deriving the asymptotic eigenvalue distribution of the channel covariance 
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matrix using free-probabilistic arguments. In addition, the RDMA scheme was studied as a 
low complexity method for mitigating intercluster interference. Finally, the CI was considered 
as a worst-case scenario where no interference mitigation techniques is employed. Based on these 
investigations it was established, that for dense cellular systems the RDMA scheme should be 
used as the best compromise between complexity and performance. For average to sparse cellular 
systems which is the usual regime in macrocell deployments, lA should be employed when the 
additional complexity and availability of CSI at transmitter side can be afforded. Alternatively, 
CI could be preferred especially for highly sparse cellular systems. 

Appendix A 
Proof of Theorem 

Theorem A.l: Let A = [0 B 0] be the concatenation of the variance-profiled Gaussian matrix 
B = C G and a number of zero columns. Let also k be the ratio of non-zero to total columns 
of A, /3 be the ratio of horizontal to vertical dimensions of B and q the Frobenius norm of C 
normalized by the matrix dimensions. The R-transform of A^A is given by: 

k - zqk{(3 + 1) ± v^F (g2/32^2 -2q(3z-2 z'^q^(3 + l-2qz + z^q^ + 4 zqk) 



TZij^HA{z,k,P,q) 



2z {ql3z - k) 

(41) 



Proof: Let B = C G„xm be a variance-profiled Gaussian matrix with (3 = m/n and 
q = WCW^/nm. According to [[TOl . the R-transform of -B^B is given by: 

'^iB»B(^) = ^. (42) 

Using eq. (|23l ). the Stieltjes transform of -B^B can be expressed as: 

-z + q-qP±^z^~2zq-2zqP + q^ - 2 g^/? + g2^2 
'^^b^bI^) = ^^ ■ (43) 

Matrix -A^A has identical eigenvalues to -B^B plus a number of zero eigevalues with < 
k < 1 defined as the ratio of non-zero eigenvalues over the total number of eigenvalues. As a 
result, the a.e.p.d.f. of -A^A can be written as: 

/iaha(^) = kh^H^iz) + (1 - k)5{x). (44) 
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Using the definition of the Stieltjes transform [|30ll : 



1 — k 
<Siji^Hji^{z) = kSi^H^iz) (45) 



and employing eq. (|23l) . the proof is complete. 
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( Useful MID Signal 
Vlntercluster Interference 




Cluster of M cells 



Fig. 1. Graphical representation of the considered cellular system modelled as a modified version of Wyner's model. K UTs are 
positioned between each pair of neighboring BSs with path loss coefficients 1 and a respectively. All BSs and UTs are equipped 
with n = K + 1 antennas. The UTs positioned within the box shall be jointly processed. The red links denote intercluster 
interference. 
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Fig. 2. Per-cell throughput scaling versus the cluster size M. The performance of global MJD does not depend on the cluster 
size, while for all interference mitigation techniques, the penalty due to the clustering diminishes as the cluster size increases. 

TABLE I 

Parameters for throughput results 



Parameter 


Symbol 


Value 


Range 


Figure 


Cluster size 


M 


4 


3-10 


m 


a factor 


a 


0.5 


0.1-1 


m 


UTs per cell 


K 


5 


2-10 


m 


Antennas per UT 


n 


4 


3-11 


m 


UT Transmit Power 


7 


20dB 






Number of MC iterations 




10=* 
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Fig. 3. Degrees of freedom versus the cluster size AI. The lA dofs approach the global MJD dofs as the number of UTs and 
antennas increases. 
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Fig. 4. Per-cell throughput scaling versus the a factor. For high a factors, RDMA performance converges to interference 
alignment, whereas for low a factors RDMA performance degrades even beyond the cochannel interference bound. While 
the performance of global MJD and RDMA increase monotonically with a, the performances of interference alignment and 
cochannel interference degrade for medium a factors. 
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Fig. 5. Per-cell throughput scaling vs. the number of UTs per cell K. A superlinear scaling of the performance can be observed, 
resulting primarily from the increase of spatial dimensions (more antennas) and secondarily from the increase of the system 
power (more UTs). The slope of the linear scaling is affected by the selected interference mitigation technique. 



